Accounting for Stability of Retrieval Algorithms using Risk-Reward Curves
نویسنده
چکیده
Past evaluation of information retrieval algorithms has focused largely on achieving good average performance, without much regard for the stability or variance of retrieval results across queries. In fact, two algorithms that superficially appear to have equally desirable average precision performance can have very different stability or risk profiles. A prime example comes from query expansion, where current techniques typically give good average improvements in mean average precision, but are also unstable and have high variance across individual queries [3]. We propose the use of risk-reward curves and related statistics to characterize the tradeoff an algorithm exhibits between a reward property such as mean average precision and a risk property such as the variance of the algorithm – particularly the downside variance, when the algorithm fails or makes performance worse. Such evaluation methods are broadly applicable beyond query expansion to other retrieval operations that must balance risk and reward, such as personalization, document ranking, resource selection, and others.
منابع مشابه
Image retrieval using the combination of text-based and content-based algorithms
Image retrieval is an important research field which has received great attention in the last decades. In this paper, we present an approach for the image retrieval based on the combination of text-based and content-based features. For text-based features, keywords and for content-based features, color and texture features have been used. Query in this system contains some keywords and an input...
متن کاملارزیابی و مقایسه توان مدلهای مبتنی بر شاخصهای حسابداری ریسک و بتای پاداشی در پیشبینی بازده سهام
چگونگی اندازه گیری و دخیل نمودن ریسک، یکی از مباحث چالش برانگیز در مدلهای ارزشیابی سهام میباشد. در این مقاله اثربخشی دو روش متفاوت از اندازه گیری ریسک مورد مقایسه قرار گرفته است. در روش اول بر مبنای مدل شاخصهای حسابداری ریسک، کوواریانس خصوصیات بنیادی شرکت از جمله سود حسابداری و بازده مازاد حقوق صاحبان سهام با عوامل بازار مربوطه به عنوان تعدیل ریسک در مدل ارزشیابی وارد گردیده و با ارزش فعلی ب...
متن کاملبهبود پایداری شبکه قدرت با روش جدید حذف بار ترکیبی
Power system blackouts have become a serious problem for electric utilities especially in recent years. Different forms of system instability have emerged in recent blackouts, such as voltage instability and frequency instability. To counteract each form of system instability, special algorithms have been designed in the protection system, e.g. Under Frequency Load Shedding (UFLS) and Under Vol...
متن کاملFinancial Reporting Fraud Detection: An Analysis of Data Mining Algorithms
In the last decade, high profile financial frauds committed by large companies in both developed and developing countries were discovered and reported. This study compares the performance of five popular statistical and machine learning models in detecting financial statement fraud. The research objects are companies which experienced both fraudulent and non-fraudulent financial statements betw...
متن کاملStock Portfolio Optimization Using Water Cycle Algorithm (Comparative Approach)
Portfolio selection process is a subject focused by many researchers. Various criteria involved in this process have undergone alterations over time, necessitating the use of appropriate investment decision support tools. An optimization approach used in different sciences is using meta-heuristic algorithms. In the present study, using Water Cycle Algorithm (WCA), a model was introduced for sel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009